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Abstract 

We present a new combinatorial algorithm for triangle finding and Boolean matrix multiplication that runs in 
0(n^/ log^ n) time, where the O notation suppresses poly(loglog) factors. This improves the previous best combi¬ 
natorial algorithm by Chan II that runs in 0(n®/log®n) time. Our algorithm generalizes the divide-and-conquer 
strategy of Chan’s algorithm. 

Moreover, we propose a general framework for detecting triangles in graphs and computing Boolean matrix 
multiplication. Roughly speaking, if we can find the “easy parts” of a given instance efficiently, we can solve the 
whole problem faster than n®. 


1 Introduction 

Boolean matrix multiplication (BMM) is one of the most fundamental problems in computer science. It has many 
applications to triangle finding, transitive closure, context-free grammar parsing, etc Q, Eol, a, M- One way to 
multiply two Boolean matrices is to treat them as integer matrices, and apply a fast matrix multiplication algorithm 
over the integers. Matrix multiplication over fields can be computed in “truly subcubic time”, i.e., computing the 
product of two n X n matrices can be done in 0(n^~^) additions and multiplication over the field. For example, the 
latest generation of such algorithms run in operations ifT^ . ||9l . These algorithms are “algebraic”, as they 

rely on the structure of the field, and in general the ring structure of matrices over the field. 

There is a different group of BMM algorithms, often called “combinatorial” algorithms. They usually reduce the 
redundancy in computation by exploiting some combinatorial structure in the Boolean matrices. The “Four Russians” 
algorithm by Arlazarov, Dinic, Kronrod, and Faradzhev III is the most well-known combinatorial algorithm for BMM. 
On the RAM model with word size w = 0(logn), “Four Russians” algorithm can be implemented in 0{n^/ log^ n) 
time. About 40 years later, this result was improved by Bansal and Williams. In their FOCS’09 Ill paper, they 
presented an 0(n^(loglogn)^/log®^"' n) time combinatorial algorithm for Boolean matrix multiplication, using the 
weak regularity lemma for graphs. Recently, Chan presented an 0(n^ (log log n)'’ / log^ n) time algorithm in his 
SODA 15 paper a, improving the running time even further. 

Although these combinatorial algorithms have worse running times than the algebraic ones, they generally have 
some nice properties. Combinatorial algorithms usually can be generalized in ways that the algebraic ones cannot be. 
For example, Chan’s algorithm partly extends an idea of divide-and-conquer in an algorithm for the offline dominance 
range reporting problem by Impagliazzo, Lovett, Paturi, and Schneider IS]; the algebraic structure of dominance 
reporting is completely different from BMM’s. Moreover, in practice, these combinatorial algorithms are usually fast 
and easy to implement, while in contrast, most theoretically fast matrix multiplication algorithms are impractical to 
implement. Finding a matrix multiplication algorithm that is both “good” in theory and practice is still an important 
open goal of the area. 

^Supported in part by NSF CCF-1212372. 
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In this paper, we generalize the ideas of Impagliazzo et al. IHl and Chan Q, to present a faster combinatorial 
algorithm for triangle detection: given an n-node graph, does it contain a triangle? 

Theorem 1 . Given a tripartite graph G on n vertices, we can detect if there is a triangle in G using a combinatorial 
algorithm in O {v?/ log'^ n) time on a word RAM with word size w > fl(log n). 0 

Vassilevska Williams and Williams na proved that triangle detection and Boolean matrix multiplication are “sub- 
cubic equivalent” in the following sense: if there is a 0(n^/g{n)) time algorithm for triangle detection on n-node 
graphs, then we can use it to solve BMM on n x n matrices in 0(n^/p(n^/^)) time. Together with Theorem[T] this 
gives a fast combinatorial algorithm for Boolean matrix multiplication. 

Theorem 2. There is a combinatorial algorithm to multiply two n x n Boolean matrices in O (n^/ log^^ n) time. 

Moreover, we generalize the algorithm, and propose a general framework for solving triangle detection combina- 
torially. 

Definition 1 . The large subgraph triangle detection problem with parameters ol, / 3 , and 7 is: given a tripartite graph 
G = {AU B U G, E), output a pair (G', b), where 

• G' is a subgraph with at least an a-fraction of vertices from A, f3-fraction of vertices from B, and y-fraction of 
vertices from G, and 

• b = 1 if G' is triangle-free, and b = 0 if G' contains a triangle. 

That is, the large subgraph triangle detection problem is to identify a large subgraph G' of G for which we can 
conclude whether G" is triangle-free or not. This problem is interesting when we can solve it quickly - faster than what 
is known for standard triangle detection. Additionally, we can show that fast algorithms for large subgraph triangle 
detection imply fast algorithms for triangle detection in general: 

Theorem 3. Let n be an integer, 0 < a, / 3 , 7 , c < 1, and G be any tripartite graph on vertex sets A, B, G with at least 
y/n vertices in each part. If there is an algorithm Lfor large subgraph triangle detection on every such G that runs 
in 0 (ca/ 37 | A| |i3| |G|) time, then we can solve triangle detection on n-node graphs in 0{cn^ -\- time for any 

e > 0 such that a/3 > 10e(l -f log i) and a > 10e(l -I- log j). 

That is, to derive an efficient algorithm for triangle finding, it is sufficient to find and solve an “easy part” of the 
input. This opens a new direction for attacking this problem. 

Related work. In the work by Bansal and Williams ||2|, they used the weak regularity lemma of Frieze and Kan- 
nan Jh) to discover and exploit small substructures in the graph. Generally speaking, a regularity lemma partitions 
the vertex set of a graph into disjoint sets, so that the edge distribution between any two sets is “close to random.” 
Bansal and Williams enumerate every triple of sets in the partition: if the subgraph induced by the triple is sparse, 
hnding a triangle in this triple is easy. Otherwise, since the induced subgraph is dense and “close to random”, the 
regularity lemma guarantees that it is impossible to check many pairs of vertices without hnding an edge between 
them. Integrating the method of Four Russians with the above approach yields an 0{n^ / log®^"^ n) time algorithm for 
triangle detection. 

Chan Q used a very different approach for triangle detection which we now outline briehjQ. Consider a tripartite 
graph on vertex sets {A, B, G) such that |A| < polylog(|i3| -f |G|) (if this is not the case, partition the set A into 
polylog(|i?| -f |G|)-size subsets, and solve them independently). If the edge set between A and B is sparse, triangle 
detection is easy. Otherwise, there is some node v € A with many neighbors in B. Then the algorithm manually 
checks every pair of neighbors of v and does two recursive calls. One is on A \ {u}, B and the non-neighbors of v in 
G, and the other is on A \ {u}, non-neighbors of u in i? and neighbors in G. On one hand, this recursive procedure 
never puts any pairs of neighbors of v in the branch. This guarantees that the algorithm only manually checks every 

* We use 0{f{n)) to suppress poly(log log f{n)) factors in the running time. 

^To keep consistency, the following description will be in the language of triangle finding, although his paper originally presented the algorithm 
in the language of Boolean matrix multiplication. 


2 



pair in i? X C at most once. On the other hand, the procedure may copy the set of non-neighbors in B when doing 
a recursive call, which increases the total input size. However, since we have a lower bound on the degree of vertex 
V to B, the procedure does not copy too many vertices each time. A careful analysis shows that the overhead of the 
recursion is actually rather tiny. 

In Section |2l we show how to extend the idea of divide-and-conquer in Chan’s algorithm to get an even faster 
algorithm for triangle detection (and hence for Boolean matrix multiplication). We give a more intuitive proof of the 
recursion involving less calculation. In Section[2 we propose a general framework for solving triangle detection, as a 
starting point for future work in this area. 

2 Triangle Detection 

PreUminaries and notations. We shall use the fact that the triangle detection problem in general undirected graphs 
is time-equivalent to the problem restricted to tripartite graphs, up to a constant factor. (The proof is straightforward.) 
Henceforth, we will assume the input graph is tripartite, and the tri-partition of its vertices is given to us. 

For a graph G = {V, E), a vertex v G V and a subset of vertices S C V, we denote d{v, S') = |i? fl ({u} x S)|, 
which we call the degree ofv to S. 

Main results. In what follows, we present an 0(n^/ log'* n) time combinatorial algorithm for triangle detection. 

Suppose we are given a tripartite graph G on vertex sets A, B, G. One (naive) approach to detect if there is a 
triangle in the graph is; for vertex v G A, and all pairs (rt, w) of v’s neighbors, check if there is an edge between u 
and w. The amount of work we do for vertex v is proportional to the number of edges between v and B (the degree 
of V to B) times the number of edges between v and G (the degree of v to C). This approach is efficient whenever 
the product of these two degrees is low on average. However, if this is not the case, there must be a vertex v G A 
with a large product of degrees. If the enumeration reports no edge between any pair of u’s neighbors, we know there 
has to be a large “non-edge area” between B and G, i.e. between u’s neighbors. In the rest of the algorithm, there is 
no need to look again at any pair of vertices in that area. We implement this idea by recursion; find disjoint subsets 
of B X G which together cover all pairs outside the non-edge area, and recurse on them. We show this recursion is 
actually efficient. 

Before stating and proving the efficiency of our main algorithm, we hrst show that the naive approach proposed 
above is “Four-Russianizable” as expected. That is, we can apply the Method of Four Russians to speed up the sparse 
case by a factor of roughly log^ n. 

The following algorithm is a generalization of an algorithm of Bansal and Williams ©• 

Lemma 1. Let G = {V, E) be tripartite on vertex sets A, B, G of sizes k, m, n respectively. If d{vi, B)d{vi, G) < 
holds for some A and all Vi G A simultaneously, then we can detect if there is a triangle in G combinatorially in 
0{mnA^^ -f + k{m + n)) time, on word RAM with word size w > fl{A log A + log kmn). 

Proof First, partition the vertices in i?(resp. G) into groups {f3i}(resp. {Gi}) of sizes A^ arbitrarily, e.g. put 
(iA^ + l)-th to (i + l)A^-th vertex of i3(resp. G) in i?i(resp. Gi). Let Sb = {*5 : |<S'| < A, S' C Bi for some Bi}, 
Sc = {S' ; [S'! < A, S C Gi for some Gi} be collections of subsets within the same group of i? or C with at most A 
vertices. For every S G Sb, S' G Sc, we determine if there is at least one edge between them, and store all the results 
in a lookup table. This preprocessing takes 

time. Note that we can index a subset using 0{A log A -f log max{TO, n}) = 0{w) bits. This table can be stored in 
the memory so that one table lookup takes constant time. 

With the help of this table, we can check if there is a triangle in G efficiently. We go over all vertices Vi G A, 
and partition its neighborhood into a minimum number of sets in Sb,Sc. That is, for every group of vertices, we 
arbitrarily partition vfs neighborhood in this group into sets of size exactly A and (possibly) one more set of size at 
most A. This generates at most sets from Sb and at most sets from Sc. Using the lookup 
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table, we can detect if there is an edge between any pair of sets from Sb and Sc in constant time. Going over all 
Vi & A takes 



mn 



-k{m + n) 


< O 


time. The total running time is at most O + k{m + n)) as we stated. 

□ 


□ 


Using this algorithm for the sparse case as a subroutine, we give a fast combinatorial algorithm for triangle detec¬ 
tion. 

Theorem 1. Given a tripartite graph G on n vertices, we can detect if there is a triangle in G using a combinatorial 
algorithm in O (jA/ log'^ n) time on a word RAM with word size w > fl(log n). 

Proof. Set parameter A = ioo(/ogiogn)^ ' remain hxed as we do the recursion, even if the instance size shrinks. 

The following algorithm detects if there is a triangle in a tripartite graph with vertex sets A, B,C: 


Step 0; If |i?| < A® or |(7| < A®, solve the instance by exhaustive search and return the answer. 

Step 1; If for all vertices Vi € A, d(vi, B)d(vi, G) < we solve the instance by the algorithm in 

Lemma[T] 

Step 2: Otherwise, hnd a vertex that violates the condition. Without loss of generality, assume vi does, 

d(vi,B)d(vi,G) > 

Step 3: Let i3i(resp. Gi) be vfs neighborhood in i3(resp. G). 

If > -1^, then recurse on {A \ {z;i}, B,C \ Gi) and [A \ B \ Bi, Gi), else recurse on 
{A \ {ui}, B\BuC) and {A \ {ui}, B^,C \ Ci). 

Return YES if either of the two recursions returned YES. 

Step 4: Check all pairs of vertices in Bi x Ci for an edge. 

Return YES if there is an edge, NO otherwise. 


Analysis of the running time. 

In each node of the recursion tree, only some of the following four subprocedures are executed; 

1. If either \B\ or \G\ is small, we do exhaustive search, which takes OdAllfJlICI) time. 

2. We spend O (|A|(|i3| + |C|)) time to check whether there is a high degree vertex and generate the inputs for 
two recursive calls. 

3. If A has no high degree nodes, we invoke the algorithm in Lemma[T] which takes 

O (^\B\\G\A^^ + + |A|(|B| + |C|)) 

= O (|i?||C|n°W + |A||i?||C|/log"n+ |A|(|i?| + |C|)) 

time. 
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4. For every pair of neighbors of vi in B and C, we check if they have an edge. This step takes 0( | Bi 11 Ci |) time. 

To analyse the total running time, we are going to bound the time we spend on the small-graph case (Subprocedure 
1) and the time we spend on the large-graph case (Subprocedure 2,3,4) in the entire execution of the algorithm (the 
whole recursion tree) separately, and sum up these two cases. 

For the case when \B\ > A® and \C\ > A®, Subprocedure 2 is cheap compared to the other steps, taking time 
0(|A|(|i3|-|-|C|)) < O (n\B\\C\/\o^ n). In this case, we mentally charge all the running time to the pairs of vertices 
in B X C, then sum up over all pairs the cost they need to pay. If A has no high degree vertices, we will run Subpro¬ 
cedure 2 and 3, which takes at most 0{\B\\C\n°^^^ + |^||i?||C|/log^n + |^|(|i?| 4- |C|)) < O {n\B\\C\/\o^ n) 
time. We charge this running time to all pairs of vertices in B x C evenly; that is, every pair of vertices gets charged 
O {n/ log^ A has a high degree vertex, we will run Subprocedure 2 and 4, which takes at most 

o{\A\m + \c\) + m\c,\) 

< 6(A2|Bi||Ci|n/log®n+|Bi||Ci|) 

< 6 (n|Bilie'll/log" n) 

time. We charge this running time to the pairs in Bi x Ci evenly, so every pair gets charged O {nj log" n). 

Claim 1. Every pair of vertices is charged at most once, over the entire execution of the algorithm. 

The proof of this claim follows from inspection of the algorithm: the above argument only charges pairs of vertices 
that are not going into the same recursive branch. 

There are nf pairs at the very beginning. Every pair gets charged at most O (nj log" n). Therefore the running 
time for the large-graph case is at most O (n®/ log" n). 

Next we bound the total running time of Subprocedure 1. This running time is proportional to the number of triples 
we enumerated in Step 0. Let T(S') be the maximum possible of this number of triples, if we start our recursion from 
vertex sets A, B, C with IBIICI < S. 

Claim 2. T{S) < nS. Moreover, for S > nA®, then 

T(S) < max {T((l - t')S) + T (f'(l - t)S)} 

t>l/A,i'>l/A2 


Proof T(S) < nS is trivial, since we never enumerate any triple more than once. If S' > nA®, we have \B\, \C\ > 
A®. That is, we must be starting the recursion from a large-graph case. There is nothing to prove if there is no high de¬ 


gree vertex in A, as we will do no enumeration in Step 0. Otherwise, let t = max | |, F = min | |, 

then by the algorithm, we have tt' > 1/A^, in particular, t > 1/A,f' > 1/A^. In the two recursive calls, 
we have ISHCI values (1 — t')S and t'{l — t)S respectively. By the definition of T, we will enumerate at most 
T ((1 — t')S) + T (f'(l — t)S) triples. This proves the claim. 


□ 


□ 


We can upper bound T(S) using this recurrence. Consider the recursion tree TZ for T(S). The root has value 
S. Its left child has value (1 — t')S and right child has value t'(l — t)S, where t and t' maximize T ((1 — t')S) + 
T {t'(l — t)S). We recursively construct the tree for the left child and right child. For a node with value x, we always 
put (1 — t')x in its left child and t'{l — f)x in its right child, for x’s optimal parameter t' and t. We expand the tree 
from nodes with value at least n® ®(> nA®) recursively. Therefore, we will get a tree with leaf values at most n® ®. 
This tree demonstrates how T{S) is computed according to the recurrence, before we reach n® ®. The sum of values 
of all leaves multiplied by n is an upper bound for T{S), since T{x) < nx. 

We calculate this sum in two cases. For every leaf, there is a unique path from the root to it, in which we follow 
the left child in some steps, follow the right child in the rest. Consider all leaves such that the unique path from the 
root to it takes at most lOAlog A(Ri lo ic^g log n ) right-child-moves. Since t' > 1/A^, the depth of the tree is at most 
A^ log n. Therefore, there are at most 


/ A^ log n \ 
VlOAlogAy 


• lOAlogA < < n° " 
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such leaves. By construction, each leaf has value at most so the sum over all leaves is at most 

For those leaves such that the path from the root takes more than lOA log A right-child-moves, consider a tree 
TV with identical structure as 72., but we set the values of nodes in a different way. We set all ts to 0 but leave all 
t's unchanged, then calculate the corresponding values. That is, the root still has value S, its left child still has value 
(1 — t')S, and its right child now has value t'S. For any node with value x in 72', its left child has value (1 — t')x 
and right child has value t'x, for the same ratio t' as the corresponding node in 72, although the value x may have 
changed. Also, leaves now may have values greater than by the construction of 72'. The tree 72' has the following 
two properties; 

1. The sum of values in all leaves is exactly S. 

2. For any node such that the path from root to it takes at least k right-child-moves, its value is at least 1/(1 — 1 / A)^ 
times the value of the corresponding node in 72. 

The first property can be proved by induction, since the sum of values of two children is exactly the value of the 
parent. For the second property, since we require t > 1/A in 72, and keep t' unchanged, set t to 0, we will gain a 
factor of 1/(1 — 7) > 1/(1— 1/A) for every right-child-move. 

By property 2 above, for all leaves that the path from root takes more than lOAlogA right-child-moves, their 
values in 72' must be at least 1/(1 — 1/A)^°^*°S'^ > A^° times the corresponding values in 72. However, the sum of 
these values in 72' is at most S by property 1. Therefore, the sum of values in 72 is at most 5'/A^°. Summing these 
two cases up, we prove that T{S) < nS/ In particular, T{n^) < n^/A^° + ® = 0{n^/ log^° n). 

Finally, we sum up the small-graph and large-graph cases, proving that the algorithm runs in 0{n^/ log^ n) time. 

□ □ 

Remark. In Lemma[T] we can preprocess for all subsets of size at most 0(log fc/loglog fc) instead of 0(A). 
This improves the running time of Theorem[T]from O ^ ) ■ 

Combining the above algorithm with the reduction by Vassilevska Williams and Williams ini, we get an efficient 
combinatorial algorithm for BMM. 

Theorem 4 (Vassilevska Williams and Williams’ 10). For any constant c, if we can solve triangle detection on n-node 
graphs in Olri^ / log° n) time, we can also solve Boolean matrix multiplication on nx n matrices in the same running 
time. 

Theorem 2. There is a combinatorial algorithm to multiply two n x n Boolean matrices in O {vf / log’^ n) time. 


3 A General Approach 

In this section, we propose a more general approach for triangle finding, which may lead to an even faster combinatorial 
algorithm. 

In the algorithm presented in Section |2] finding a high degree vertex v G A and its neighborhood Bi, Ci can be 
viewed as finding a large easy part of the input. That is, for the subgraph induced by vertices A, Bi,Ci, there is a 
2-path for every pair in Bi x Ci. Thus, we only have to spend OdiJiUCil) time to determine if there is a triangle 
in it, which is 0(1/|A|) time on average for every triple of vertices. After solving this part of the input, we do two 
recursive calls which together exactly cover the rest of the triples. The high-degree of v guarantees that the easy part 
we find each time cannot be too small. We will have saved enough time before reaching the case where 77 or O is 
close to constant size (in which we basically have no way to beat the exhaustive search). However, if all vertices have 
low degree, then that instance itself is easy (via Lemma[T]i. The following theorem generalizes this idea of reducing 
the triangle detection problem to finding a large subgraph on which triangle detection is easy. 

Theorem 3. Let n be an integer, 0 < a, /3, 7 , c < 1, and G be any tripartite graph on vertex sets A, 77, C with at least 
y/n vertices in each part. If there is an algorithm Lfor large subgraph triangle detection on every such G that runs 
in 0 (ca/ 37 |A|| 77 ||C'|) time, then we can solve triangle detection on n-node graphs in 0{cn^ -f time for any 

e > 0 such that aP > 10e(l -f log i) and a > 10e(l -f log j). 
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Proof. First note that if e < the theorem is trivial, since In the rest of the proof, we will 

assume e > and thus a/3 > Also, we may assume that G" has sizes exactly a|A|,/3|i3|, 7 |G| when it is 
triangle-free, since otherwise we can simply drop extra vertices from the sets. 

The following divide-and-conquer algorithm detects if there is a triangle among vertices A U i3 U G: 


StepO. If |A||B||G| < f do exhaustive search on all triples and return the answer. 

Step 1. Run L on A U f3 U G. 

Let G' on A' U i3' U C be the subgraph L outputs. 

Return YES if G' contains a triangle. 

Step 2. Recurse on vertex sets (A, B,C \ C), (A, B \ B', C), (A \ A', B', C). 
Return whether any of the three recursive calls returned YES. 


Its correctness is straightforward. We will prove that the algorithm is efficient. 

Eor the large-graph case, a similar “charging argument” works here as in the proof of Theorem [T] Note that the 
input A U B U G we fed to L in Step 1 always has at least ^/ri vertices in each part. By our assumption. Step 1 will run 
in 0 (ca/ 37 |A||i 3 ||G|) < 0(c|A'||i3'||G'|) time. In Step 2, the algorithm generates the input for recursive calls, which 
takes only 0(|A| + \B\ -f |G|) tim^. We charge the running time of Step 1 and Step 2 to the triples in A' x B' x C. 
On average, each triple is charged 0(c) time. Same as the proof of Theorem[T] every triple in A' x B' x C will not 
go into the same recursive branch together. Therefore, every triple is charged at most once in this argument. There are 
nf triples in total. They are charged at most 0{crA) time. This proves that Step 1 and Step 2 take at most 0{crt') time 
in the entire algorithm. 

Eor the small-graph case, the time we spent on Step 0 is proportional to the number of triples we enumerated. Let 
T{S) be the maximum possible value of this number, if we start our recursion with |A||i3||G| = S. On one hand, we 
have T{S) < S, as every triple will be manually checked at most once. Moreover, for S > nf'^, by the way that the 
algorithm does recursive calls, we have T{S) <T{{1 — ^)S)+T ( 7(1 — j3)S) + T (/ 37(1 — a)S). 

We are going to prove T(S') < ( 7 ^) ^ by induction on S. Eor S < n?'^, the new upper bound automatically 

holds, since T{S) < S < ^ Otherwise, by the recurrence and induction hypothesis. 


T{S) < n 


2.5 


(1-7)5' 


n 


2.5 


7(1-/3)5V-^ , //37(l-a)5 


.2.5 


< n 


2.5 


l-e 


\ ^2.5 

((1 - 7)^“" + (7(1 - + (/37(1 - . 

The value of (1 — 7 )^”*^ + ( 7(1 — + (/ 37(1 — is always less than 1; 


(1 - 7)'“' + (7(1 - + (/37(1 - «))'■' 

< (1 — (1 — 6 ) 7 ) -I- 7 ^“*^ (1 — (1 — e)/3) -I- (1 — (1 — e)a) (by (1 — x)‘^ < 1 — cx when 0 < c, a: < 1) 

= 1+7 (-(1 - e) + 7”' (1 + /3 (-(1 - e) + /3“" (1 - (1 - e)a)))) 

< 1 + 7 ^-(1 - e) + 7 “" + /3 ^-(1 - e) + (1 + 2 elogi)(l - (by dog ^ < 1/10 and e < 1/10) 

< 1 + 7 ^-(1 - e) + 7 “" (^1 + /3 |^e + 2elogi - y))) 

^It is dominated by the running time of Step 1. Since to determine if G' has a triangle, L takes at least f2(|A'||i?'|) time. But we have 
\A'\\B'\ > ap\A\\B\ > 10nl'®/logn > fl(n). 
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<1 + 7 


(by elog ^ < 1/10 and e(l + log < a/10) 


(^-(l-e) + (l + 2elogi)(l-^)) 

< 1 + 7 (^e + 2elogi - 

(by e(l + log i) < a/3/10) 

This proves T{S) < in particular, T(n^) < Therefore, summing up the two cases, the 

total running time of the algorithm is at most 0(cv? + 

□ □ 

Remark. Note that we do not have to restrict ourselves to find an a-fraction of A, /3-fraction of B and 7 -fraction 
of C. As long as we can find one part with a-fraction, one with /3-fraction and the third with 7 -fraction, and adjust 
the inputs for recursive calls correspondingly, we will be able to solve triangle detection efficiently. In this sense, the 
algorithm in Section |2] has parameters c = ■^,a = l,/3 = 3,7 = ■^,e = Q( AiogA )’ '''bile G" is the subgraph 
induced by (A, f3i, Gi) or the entire graph if all vertices in A have low degree. 


4 Conclusion 

We have shown how to generalize the idea of divide-and-conquer in Chan’s algorithm, and have provided a more 
intuitive proof of the recursion. The way of analysing the “sublinear” recurrence in Theorem[T] i.e., our T(S) and its 
analysis, should be able to extend to other problems. We would like to see more applications of this method in proving 
the efficiency of other combinatorial algorithms that are based on divide-and-conquer. 

Also, we would hope to have an O(n^) time algorithm for triangle detection on tripartite graphs with vertex set 
sizes n, n, and 0(log'* n). We call this the “lopsided” triangle detection problem, where one side of vertices is very 
small compared to the others. An argument similar to the reduction by Vassilevska Williams and Williams ns shows 
that this would yield an O(n^) time algorithm for multiplying n x n and n x //(log"' n) Boolean matrices, improving 
the maximum outer dimension d that n x n and n x d Boolean matrices can be multiplied in 0{n?) time, in both the 
combinatorial and the algebraic world. The current record of d is 0(log^ n) by Chan’s algorithm (Chan gave an 0{n^) 
time algorithm for multiplying n x d and d x n matrices, which implies an 0{n^ ) time triangle finding algorithm), 
while the record in the algebraic world is merely 0(log n) 13. 

Finally, we provide one type of instance for the lopsided triangle detection problem which seems hard to solve in 
0{v?‘) time with our current techniques: a graph G on vertex sets A, B, C with |A| = |i3| = n and \C\ = //(log"' n), 
with roughly 1 / log n fraction of edges between A and G, 1/ log n fraction of edges between B and G, and constant 
fraction of edges between A and B. The main difficulty is that the size of G is too small. If we try to do recursion, its 
size will reach a constant too soon. Once it becomes of constant size, we basically have no way to save anything from 
exhaustive search. 

Acknowledgement. The author would like to thank Ryan Williams for helpful discussions on results and writing of 
the paper, and the anonymous reviewers for their valuable comments. 
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